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AMPLIFICATION OF NUCLEIC ACIDS 
CROSS-RELATED APPLICATIONS 
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In yet other embodiments, the nucleic acid template is selected from ,h 
.cup costing of genomic DNA, cDNA, plasmid DNA, DNA ££££ 

These and other aspects of the invention will become evident uoon 
5 reference to the following detailed description and attached drawing Tn addZ 
vanous references are set forth below which describe in more detail ceLn proled^ 
or comp 0sl , 0 Each of ^ references ^ incorporated herdn 

entirety as if each were mdividually noted for incorporation. 

10 BRIEF DESCRIPTION OF THE DRAWINGS AND SEQUENCE LISTING 

Figures 1A and IB are photomicrographs depicting the electrophoretic 
Vari ° US t6St fragmentS " ld ^ S ° ,Uti0nS aCCOrd ^ t0 ^ 1 

15 ^ f ,k F !r e 2 iS 3 P^^aph de P*ting the electrophoretic screening 
15 data for the amplification oftheCGG triplet repeat in Fragile X Syndrome. ' 

Figure 3 is a photograph of amplification of cDNAs from a small 
number of cells and tissues. M, molecular weight markers; JTP, cDNA from activated 
Jurkat T cells; J1000, cDNA from 1000 Jurkat cells; J5000, cDNA from 5000 Jurkat 
cells; Earl, cDNA from human inner ear; Ear2, cDNA from human inner ear. 
20 SE °- ID No - 1 is a D NA sequence of the M13 forward primer. 

SEQ ID No. 2 is a DNA sequence of the M 1 3 reverse primer. 
SEQ ID No. 3 is a DNA sequence of MHCclnl, which is derived from 
the MHC gene region. 

SEQ ID No. 4 is a partial DNA sequence of human OCT-T1 cDNA. 
25 SE Q ID No - 5 is a DNA sequence of human OCT1 cDNA. 

SEQ ID No. 6 is a sequence of a primer used to amplify the repeat region 
of the Fragile X gene, FMR1 . 

SEQ ID No. 7 is a sequence of a primer used to amplify the repeat 
region of the Fragile X gene, FMR1. 

30 SE Q 10 No - 8 is a P^al DNA sequence of TFR, the human transferrin 

receptor. 

DETAILED DESCRIPTION OF THE INVENTION 

Prior to setting forth the invention, it will be helpful to an understanding 
35 of the invention to define certain terms that are used hereinafter. 
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Nucleotides are depicted according to their recognized abbreviations, 
that is, "A" refers to adenine, "C" refers to cytosine, "G" refers to guanine, "T" refers to 
thymine, and "N" refers to either A or C or T or G. 

As used herein, "amplification" refers to the increase in the number of 
5 copies of a particular nucleic acid fragment resulting from an enzymatic reaction, such 
as polymerase chain reaction, ligase chain reaction, or the like. 

As used herein, "oligonucleotide" refers to a nucleic acid molecule 
comprising two or more deoxyribonucleotides or ribonucleotides, and preferably more 
than three. An oligonucleotide may be synthesized or produced by amplification or 
10 cloning. 

As used herein, "polymerase chain reaction" or "PCR" refers to a 
particular process of amplification for the exponential amplification of a specific DNA 
fragment by utilizing two oligonucleotide primers that hybridize to opposite strands and 
flank the region of interest in a target DNA. The process consists of a repetitive series - 
1 5 of cycles involving template denaturation, primer annealing, and the extension Of the 
annealed primers by a DNA polymerase, such as the thermostable Tag DNA 
polymerase. 

As used herein, a "primer" is an oligonucleotide that anneals to one 
strand of a nucleic acid template and allows elongation of a complementary strand by a 
20 polymerase. 

As used herein, "template" refers to a sequence of nucleotides that can be 
copied by a DNA polymerase from a primer bound to the template. 

As used herein, a "zwitterion" refers to a molecule that is a dipolar ion. 
At the isoelectric pi of the molecule, it will have no net charge; at a pH range between 
25 approximately the and the K b , the molecule bears little net charge. 

I. Amplification of nucleic acid templates 

Amplification using thermostable enzymes generally involves multiple 
cycles of denaturation, annealing, and synthesis performed at different temperatures. 

30 An amplification reaction typically contains either heterogeneous or homogenous 
template nucleic acid, one or more oligonucleotide primers, nucleotides, various buffers 
and cofactors, and DNA polymerase. To amplify heterogeneous G+C rich templates, a 
zwitterion and compound that disrupts base pairing are added to the mix; if the template 
is homogeneous (e.g., plasmid clone DNA), either a zwitterion or compound is added. 

35 Protocols vary somewhat in concentrations of reagents, temperatures, 

and incubation times, depending upon the apparatus, length of amplified product, 
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enzyme used, and other parameters. Protocols are readily available {see, for example 
manufacturer's instructions, Ausubel et al., Current Protocols in Molecular Biology 
Greene Publishing, 1995; and Examples herein). Generally, a ••hot-start" procedure is 
used, so that all reagents, save one, such as nucleotides, are mixed, and the temperature 
is raised to at least annealing temperature before the saved reagent is added. 
Amplification reactions may be analyzed by any of a variety of known methods. For 
example, the reaction may be electrophoresed in a polyacrylamide or agarose gel. The 
amplified material may be detected by staining with a dye, such as ethidium bromide, or 
by autoradiography if a radioactive label is incorporated. 



II. Zwitterions and compounds that disrupt base pairing 

As noted above, the present invention provides compositions comprising 
a zwitterion or dipolar ion in combination with a compound that disrupts base pairing to 
enhance and improve amplification of nucleic acids that are G+C rich. Typically^ 
1 5 examples of G+C rich regions that show improvement of amplification in the present 
invention contain about 60% G+C over greater than 50 nucleotides, about 80% G+C in 
up to 500 nucleotide region, and 100% G+C in up to about 200 nucleotides. 

A zwitterion contains both an acidic group and a basic group. The acidic 
group is associated and the basic group is dissociated at pH values between about the 
20 pKj (the pH at which the acidic group is half associated and half dissociated) and about 
the pKt, (the pH at which the basic group is half associated and half dissociated). 
Dipolar ions are encountered, for example, whenever a molecule contains both an 
amino group and an acid group, providing that the amine is more basic than the anion of 
the acid. For example, all amino acids exist in a zwitterionic form between about pH 4 
25 and about pH 9. Over this pH range, the amino acids bear little net charge, and at the 
isoelectric point, pi, bear no net charge. 

The zwitterions for use in the present invention must have a basic group 
containing at least one alkyl group, which is preferably a methyl group. Preferably the 
basic group is an amine. A general formula for the zwitterion is thus: 
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(R) 3 N-(CH,) r COO- 
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wherein R is H, CH 3 , CH 2 -CH 3 , or (CH,) r CH3, at least one R is not H, i is from 2 to 
about 6, and j is from 2 to about 6. Preferred zwitterions are betaine (trimethyl 
glycine), D-carnitine, dimethyl glycine, and monomethyl glycine. 




6 



10 



For use in the present invention, the zwitterion should not inhibit DNA 
polymerases. It is preferable that the DNA polymerase retain at least 90% of activity at 
2.5 M zwitterion concentration. The effect of the zwitterion on nucleic acid molecules 
is a decrease in the Tm of GC base pairs and an increase in the Tm of AT base pairs. 
The decrease or increase is preferably at least 2°C, at least 3°C, at least 4°C, and most 
preferably at least 5°C. 

Compounds that disrupt base pairing {e.g., disrupt hydrogen bonding) 
include DMSO, formamide, sodium perchlorate, glyoxyl, and the like. Such 
compounds may be identified by their ability to lower the melting temperature of a 
nucleic acid duplex. Such compounds should not inhibit DNA polymerases. It is 
preferable that the DNA polymerase retain at least 90% of activity at a 10% compound 
concentration. 

A zwitterion or a compound that disrupts base pairing can be tested for 
utility in the context of the present invention by demonstration of at least a two fold 

15 increase in the amount of amplified product from a homogeneous template of at least" 
80% G+C, in the presence of the zwitterion or compound as compared to the absence of 
the zwitterion or compound. When heterogeneous templates (i.e., multiple templates) 
are to be amplified, the zwitterion and compound that disrupts base pairing are tested in 
combination. The zwitterion and compound are added in amounts sufficient to cause an 

20 increase in efficiency of amplification. In general, the zwitterion is added from about 
0.1 M to about 3 M, from about 0.5 M to about 4.5 M, or from, about 1 M to about 2.5 
M. In general, the compound is added from about 5% to about 15%, or preferably from 
about 5% to about 10%. 



25 III. Two enzyme amplification systems 

As noted above, an improvement in amplification is achieved when 
amplification is performed in a two-enzyme system. • 

Two-enzyme systems are used for amplification of long fragments. In 
general, one of the enzymes has a proofreading activity (3' 5' exonuclease) that 

30 corrects nucleotide misincorporations that may otherwise prematurely terminate DNA 
synthesis, and the other enzyme is highly processive. Preferably, one or both of the 
enzymes lack 5* -» 3' exonuclease activity. Also, preferably, the enzymes are 
thermostable. Examples of combinations of enzymes that are commonly used are 
Klentaql plus Pfu DNA polymerases (1:15 v/v), rTth plus Vent® isolated from 

35 Thermococcus litoralis DNA polymerases (Perkin-Elmer, Foster City, CA); Taq and 
Pyrococcus DNA polymerases (Life Technologies, Gaithersburg, MD); and Taq and 
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/Vo DNA polymerases (Boehringer-Mannheim, Indianapolis, IN). Other combinations 
may be used. 

IV. Application of amplification of G+C rich templates 

As described herein, the present invention provides compositions and 
methods for uniform amplification of nucleic acid templates regardless of G+C content 
Such amplification allows accurate quantitation, detection of G+C rich regions 
diagnosis of certain diseases, and improves construction of cDNA libraries DNA 
sequence analysis, and the like. For example, 5' ends of genes are G+C rich and are 
often not amplified well during construction of a cDNA library. Some coding 
sequences are G+C rich and will also be underrepresented in a cDNA or genomic 
library. Without the present invention, if the starting material was a cDNA library with 
G+C-nch cDNA present in medium abundance (<0.1%), upon amplification, this G+C- 
nch nucleic acid would be less efficiently amplified and gradually lost. Thus to fin* 
the cDNA in a screen by hybridization, several million clones would likely need to be^ 
examined. Using the present invention, the G+C rich cDNA should be present in the 
amplified material at the same level as in the starting material. Following amplification 
of cDNA from RNA isolated from a small number of cells, differential display 
amplification can be performed. 

Moreover, several genetic diseases are caused by a natural expansion of 
a G+C-rich triplet. Such diseases include fragile X syndrome, spinobulbar muscular 
atrophy, myotonic dystrophy, Huntington's chonea, and spinocerebellar atoxia type 1. 

Furthermore, in addition to enhancing amplification, the present 
invention may improve the efficiency of nucleic acid synthesis in general, including 
reverse transcription, in vitro transcription, primer extension and the like. 

EXAMPLES 

EXAMPLE 1 

30 Amplification of G+C Rich Templates Using Taq DNA Polymerase 

and Betaine or DMSO 
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A number of test DNAs are utilized in amplification reactions. Four of 
the test fragments are: 
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,n „ , 3 2 7 ^ CDNA enC ° ding ** 0Ctamer bindin 8 Protein, OCT1 

(GenBank accession No. X13403; Sturm et al., Genes & Dev. 2:1582, 1988- SEQ ID 
No. 8), which has a 52% G+C content; 

(ii) a 1.6 kb fragment from the MHC region, MHCclnl, which has a 64% 
G+C content (see Shukla et al., Nucleic Acids Res. 79:4233, 1991; GenBank Accession 
No. L20433; SEQ ID No. 3, which presents nucleotides 320 to 850 of L20433); 

(iii) a 0.8 kb fragment of human OCT-T1 cDNA, which encodes a 
transcription factor expressed in T lymphocytes that binds the octamer sequence, and 
which has an 80% G+C content (see Bhargava et al., Proc. Natl. Acad Sci USA 
91: 10260, 1993; SEQ ID NO. 4); and 

(iv) a 0.72 kb fragment of human transferrin receptor, TFR, which has a 
44% G+C content {Nature 5/7:675, 1984; GenBank Accession No. X01060; SEQ ID 
No. 5, which presents nucleotides 352 to 866 of X01060). 

As each of these fragments are cloned in pBluescript or a pUC plasmid" 
(MHCclnl), primers for amplification of these fragments are the M13 forward primer" 
(CGCCAGGGTTTTCCCAGTCACGAC ; SEQ. ID No. 1) and the M13 reverse primer 
(AGCGGATAACAATTTCACACAGGA ; SEQ ID No. 2). Thus, the sizes of the 
amplified fragments for each of these four templates, with the exception of MHCclnl, 
include 200 bp of multiple cloning sites of pBluescript vector. 

The two G+C-rich templates are unsuccessfully amplified (MHCclnl 
and OCT-T1) using AmpliTaq® DNA polymerase (Perkin-Elmer Cetus) under standard 
conditions. (See Science 239:487, 1988). The addition of DMSO (10% v/v), betaine 
(up to 2.5 M), glycerol (10% v/v), formamide (10% v/v) and TMAC1 
(tetramethylammonium chloride) (0.1 mM to 1.0 M) or increasing annealing and 
denaturation temperatures did not result in visible amplification. In addition, betaine 
(>1.0 M), formamide (>5%), and TMAC1 (>100 mM) are inhibitory for amplification of 
the non-G+C-rich template OCT1. 



EXAMPLE 2 

AMPLIFICATION OF G+C RlCH TEMPLATES USING A TWO ENZYME DNA POLYMERASE 

System and Betaine or DMSO 

Each of the G+C-rich templates is set up in an amplification reaction 
using a two enzyme mixture, with and without the addition of betaine (0.5 to 2.5 M), 
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.e^fr" f0m,amide (5 "'° %) ' glyCer °' (5 - ,0% >' ™« (0.1 mM to I M) and 
tetramethylammoiuum acetate (TMAA) (0. 1 mM - 1 .0 M). 

dH 9 0 at ^^0" (50 p.) i s perf onncd in 20 ^ Tris . HC , 

sulfate 2.5 mM magnestum chloride, 200 uM of each dNTP, 50 ng each of the M13 

rward pnmer »d the M.3 teverse primer, 0.5-1.0 »g of plasmid templates aid 0 2 0 
pi of Klenta, LA16 (a 1:15 v/v ratio of Klentaql (Ab Peptides Inc., St Louis MO) 
whrch ,s TaqDNA polymerase laclcing the 5' to 3' exo.uc.eaae activ V , and £ 

Barnes, ft™ ^ & , VSA 91:22 ^ ^ DMS0 _ 

wtthou, dNTPs and overlatd with mineral oil. Amplification is performed in a 
header mstrumen, using thin-walled tubes (0.5 ml). The tubes are heated a, 
95 C for 15 seconds and amplification is initiated by the addition of dNTPs at 80°c" 
The followmg cycling conditions are used for 25 cycles: denaruration a, 94°C for 1? 
seconds, annealing a, 50 to 55*C for 1 minute, and extension a, 68°C for 5 minutes 
Following amplification, ,0 pi of me resulting product is maded onto a 1% agarose gel 
(xe Molecular Cloning: A Laboratory Manual by J. Sambrook e, a!., Cold Spring 
Harbor, 1989). The amplified DNA is stained with ethidium bromide and me g "! 
photographed while UV-irradiated by a transilluminator. 

The results are shown in Figures 1A and IB. The individual numbered 
lanes m Ftgures 1A and IB, contain me template and additives listed in the following 



TABLE 1 



| Lane 


| Template* 


Additive 


Lane 


I Template 


| Additive 


1 


OCT1 


2.2 M betaine 


10 


all four 


10% DMSO 


2 


OCT1 


10% DMSO 


11 


all four 


1 . 1 M betaine 


3 


TFR 


2.2 M betaine 


12 


all four 


2.2 M betaine j 


4 


TFR 


10% DMSO 


13 


all four 


5% DMSO + 1. 1 M betaine 


5 


MHCclnl 


2.2 M betaine 


14 


all four 


5% DMSO + 2.2 M betaine 


6 


MHCclnl 


10% DMSO 


15 


all four 


10% DMSO + 1.1 M betaine 


7 


OCT-T1 


2.2 M betaine 


16 


all four 


10% DMSO + 2.2 M betaine 


8 


OCT-T1 


10% DMSO 


17 


all four 


7% DMSO + 1.0 M betaine 


9 


all four 


5% DMSO 


18 


all four 


5% DMSO + 1.4 M betaine 
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* "all four" refers to the mixture of the 4 templates, OCT1, OCT-Tl, TFR and MHCcInl. 

As shown in Figure 1A, each of the templates could be amplified in the 
presence of betaine or DMSO. Furthermore, DMSO and betaine do not have a negative 
5 effect on amplification. In contrast, the addition of tetramethylammonium chloride 
(TMAC1), which is also a tetraalkylammonium ion, does not have a similar effect as 
that found with betaine. Moreover, TMAC1 and TMAA (at 0.1-1.0 M) inhibit 
amplification of OCT1. The addition of betaine and/or DMSO aids in the amplification 
of G+C-nch templates (lanes 5-8, Figure 1A), while not having any adverse effect on 

1 0 the amplification of the other templates (lanes 1 -4, Figure 1 A). 

Amplification is also performed on a mixture of the four templates 
varying the concentrations of DMSO and/or betaine. As can be seen in Figure IB, the 
four templates are not amplified uniformly upon addition of either DMSO or betaine 
alone (lanes 9-12). All the templates were amplified to a varying extent in the presence 

15 of 2.2 M betaine (lane 12), 5% DMSO + 2.2 M betaine (lane 14), or 10% DMSO + 1 f 
betaine (lane 5). When 7% DMSO with 1.0 M betaine and 5% DMSO in combination 
with 1.4 M betaine (lanes 17 and 18) were tested, all templates were amplified 
uniformly. Thus, by adding both DMSO and betaine, preferably within a range of 1 0 
M betaine with 6-8% DMSO or 5% DMSO with 1.2-1.8 M betaine, all four templates 

20 are amplified. 

Thus, approximately equimolar amplification of all the templates 
occurred at 1.0 M betaine + 7% DMSO (lane 17), and 5% DMSO +1.4M betaine (lane 
18). A combination of 1.0 M betaine with 6-8% DMSO or 5% DMSO with 1.2-1.8 M 
betaine resulted in similar uniform amplifications; accordingly such concentrations are 
preferred. Although at a higher concentration of DMSO (12-15%) or betaine (1.8-2.5 
M), all the templates showed variable amplification, approximately equimolar yield of 
individual components of the template mixtures were obtained when DMSO and 
betaine were within the preferred concentration range and were combined with the LA 
system. 
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EXAMPLE 3 

Amplification of Templates using Two Enzyme DNA Polymerase Systems and 

Betaine or DMSO 

Various single and two enzyme systems were used to amplify OCT-T1 
(80% G+C) or TFR (44% G+C) templates. Amplification reactions were performed as 
above in the presence of 2 M betaine. The results are presented below. 



Table 2 



Enzyme System 


Enzymes 


OCT-T1 
(80% G+C) 


TFR 
(44% G+C) 


Sensitivity 
to Betaine 


LA- 16 


Klentaql and 
Polymerase 


+++ 


+-Hh 




AmpliTaq® DNA 
Polymerase 


xTaq DNA 
Polymerase 


-/+ 


+-h+ 




XL PCR (Perkin-Elmer) 


xTth and T. litoralis 
DNA Polymerase 


+++ 


+++ 




Elongase (BRL) 


Tag and 

Pyrococcus DNA 
Polymerase 


+++ 


+++ 




Expand System 
(Boehringer Mannheim) 


Taq and Pwo DNA 
Polymerase 


++ 


+-h+ 




Vent® DNA Polymerase 
(NE Biolabs) 


T. litoralis 
Polymerase 




+++ 


+ 


Deep Vent™ Polymerase 
(NE Biolabs) 


Pyrococcus 
Polymerase 






+ 
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As shown above, all enzymes, except Pyrococcus polymerase, amplified 
TFR, a template with average G+C content. However, only the two enzyme systems 
were able to amplify OCT-T1, a template with high G+C content. 
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EXAMPLE 4 

Amplification of Templates using Structural Analogues of Betaine 

Various analogues of betaine and two tetraalkylammonium salts are used 
5 to amplify OCT-T1 or TFR templates. Amplification reactions are performed as above 
in the presence of either 2 M analogue or at least 0.15 M salt. Results are presented in 
the Table below. 



Table 3 



Compound 


OCT-T1 


TFR 


Monomethyl glycine (2 M) 


-/+ 


+++ 


Dimethyl glycine (2 M) 


+ 


4-H- 


Glycine betaine (2 M) 


+++ 


+++ 


D-carnitine (2 M) 


+++ 


+-H- 


TMAC (>0.15 M) 






TMAA (>0.15 M) 
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The structure of the compounds is shown below. 

Table 4 



Glycine betaine 


(CH 3 ) 3 N-CH 2 -COO" 


D-carnitine 


OH 

1 + 
OOC CH 2 — CH— CH 2 — N— (CH 3 ), 


Monomethyl glycine 


CH 3 — H 2 N-CH 2 — COO" 


Dimethyl glycine 


(CH 3 ) 2 — NH-CH 2 COO" 


TMAC1 


(CH 3)4 — N-Cl" 


TMAA 


+ 

(CH 3 ) 4 — N — CH 2 COO* 



As shown in the table, TMAC1 and TMAA are unable to facilitate 
15 amplification of either template and most likely inhibit the enzyme. However, the 
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zwitterions facilitated amplification of both templates, the quaternary ammonium ions 
working best. 

EXAMPLF S 

Amplification of G+C Rjch Repeats of the Fragile X Gene 

Amplification of the G+C-rich repeat expansion involved in Fragile X 
Syndrome is performed. Fragile X Syndrome is one of the most common causes of 
mental retardation affecting 1 in 1,500 males, and 1 in 2,500 females. The fragile X site 
has been mapped to Xq27.3, and shown to be due to the expansion of CGG triplet 
repeats in the 5'-end of the FMR1 gene. Normal subjects have 2-50 CGG repeats with an 
average at 29 repeats, those with premutation have about 50-200 repeats, and 
individuals with disease have more than 200 repeats (/.*., a full mutation) (see 
Nussbaum and Ledbetter, Ann. Rev. Genetics 20:109, 1986; Verkerk et al., Cell 65 90S~ 
1991; and Kremer et al., Science 252:1711, 1991). Diagnostic confirmation of tbi 
disease has primarily been accomplished by cytogenetic analysis of the cells and 
Southern blotting of the DNA from these patients (see Sutherland, Am. J. Hum. Genet. 
31:125, 1979; and Rousseau et al., N. Engl. J. Med. J25.1673, 1991). The sequence of 
the fragile X CGG repeat region may be found in GenBank accession No. X61378.gb- 
pr, and encompasses nucleotides 2599 to 2868. 

The methods described herein are used to amplify the G+C-rich fragile X 
CGG triplet repeat region (85% G+C) directly from genomic DNA of a cell line with a 
normal FMR1 gene, JV, (29 CGG repeat), genomic DNA from 2 males with 
premutations (74 and 119 CGG repeats, respectively), and genomic DNA from 2 
patients carrying full mutations (>1000 CGG repeats). The sizes of the CGG repeats 
were obtained by Southern blotting and hybridization with specific probes. 

Amplification is performed as described above in the presence of 10% 
DMSO + 1.1 M betaine or 5% DMSO + 2.0 M betaine. The sequence of primers used 



are: 



primer c: GCTCAGCTCC GTTTCGGTTT CACTTCCGGT (SEQ ID No. 6) 
primer f: AGCCCCGCAC TTCCACCACC AGCTCCTCCA (SEQ ID No. 7) 



Either 10 or 100 ng of genomic DNA from each sample is mixed in 
20 mM Tris-Cl (pH 9.0) containing 150 ug/ml bovine serum albumin, 16 mM 
ammonium sulfate, 2.5 mM magnesium chloride, and 50 ng of each primer. All the 
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components, except dNTP, are denatured at 94°C for 15 sec, and dNTPs are added at 
80°C. Amplification proceeded using the following cycling conditions for 40 cycles: 
denaturation at 94°C for 15 sec, annealing at 65°C for 1 minute, and extension at 68°C 
for 5 min (or annealing and extension was alternatively done at 68°C for 5 min). 
5 Following amplification, 10 jil of the reaction was loaded on a 2% agarose gel ahd the 
bands visualized by ethidium bromide staining. 

The results are depicted in Figure 2, in which lane 1 is JY DNA (10 ng); 
lane 2 is JY DNA (100 ng); lane 3 is 74 CGG repeat containing DNA; lane 4 is 119 
CGG repeats containing DNA; lane 5 is DNA containing heterozygous full mutation in 

10 a female patient; and lane 6 is a full mutation from a male DNA sample. 

An approximately 300 bp PCR product is observed for JY (Figure 2, 
lanes 1 and 2), and the expected size fragments for 74 repeat (440 bp) and 1 19 repeats 
(575 bp) are observed as well (Figure 2, lanes 3 and 4). However, no specific band is 
observed for the DNA from a male patient carrying a full mutation (Figure 2, lane 5), 

15 and only a normal band (300 bp) is observed for DNA from a female patient carrying a 
full mutation as well as a normal gene (Figure 2, lane 6). Thus, the normal sized repeat 
(with an average repeat number of 29) and remutation expansion can be visibly 
amplified, whereas the full mutation (>200 repeat) is not visibly amplified. 

20 EXAMPLE 6 

Amplification of cDNAs using Betaine or DMSO 

First strand cDNAs synthesized from total RNA isolated from a small 
number of cells and tissues are amplified. Either Jurkat T cells, induced for 4 hrs with 

25 50 nM phorbol 12-myristate 13-acetate and 2 ng/ml phytohemagglutinin, or human 
inner ear samples, obtained during a surgical procedure, were used. Total RNA was 
isolated, and double-stranded cDNA was made. An adaptor containing a Notl site was 
ligated to the cDNAs. Amplification reactions are performed as described above using 
Klentaq LA 16, using the adaptor as primer in the presence of 1.5 M betaine and 7% 

30 DMSO, except that each cycle consisted of denaturation at 95°C for 15 sec, annealing at 
55°C for 1 min, and extension at 68°C for 5 min. A second round of amplification was 
performed on 1 jal of reaction mix using the cycle conditions: denaturation at 95 °C for 
15 sec, annealing at 60°C for 1 min, and extension at 68°C for 5 min. 

As shown in Figure 3, uniform amplification occurred using cDNA from 

35 tissues (Earl and Ear2) as well as from cell lines (JTP, J1000, J5000). Moreover, even 
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when small numbers of cells are used to extract total RNA, amplification is substantial 
(lanes J 1000, J5000). 



It will be appreciated that, although specific embodiments of the 
5 invention have been described herein for purposes of illustration, various modifications 
may be made without departing from the spirit and scope of the invention. 
Accordingly, the invention is not limited except as by the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANTS: Sherman M. Weissman 
Namadev Baskaran 

(ii) TITLE OF INVENTION: AMPLIFICATION OF NUCLEIC ACIDS 

(lii) NUMBER OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : SEED and BERRY LLP 

(8) STREET: 701 Fifth Avenue. 6300 Columbia Center 

(C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: USA 

(F) ZIP: 98104 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0. Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 29-N0V-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Nottenburg Ph.D.. Carol 

(B) REGISTRATION NUMBER: 39.317 

(C) REFERENCE/DOCKET NUMBER: 390036. 402C1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (206) 622-4900 

(B) TELEFAX: (206) 682-6031 



(2) INFORMATION FOR SEQ ID NO:l: 

(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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CGCCAGGGTT TTCCCAGTCA CGAC 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 
AGCGGATAAC AATTTCACAC AGGA 24 
(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1652 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 



GGATCCGGCC 


CCACGGAGGT 


CCCCATCTCC 


CTCAAGATTC TCAGATTCAT CCCCAATGAG 


60 


TGGTGTAGCC 


CCTACAGGGG 


TGTCAGCCCC 


CCTCATCACC AACAGTGACA GTGACAGAGG 


120 


CTGGAGATGA 


GGGGCCAGCA 


GGCTCCAGGG 


AGTCGGGGGT GGCCTTGGGC AGGGTTTCTT 


180 


CACTACGAGG 


GGTGTTCCCC 


AAAGAGCCAT 


GAACTGTAGA GGAAGAGAAA AAGTTCAGAG 


240 


CTAAGGGCTC 


AGGAGATCCT 


GTGTATTTAG 


GGAAGGTGAC GGTCCAATTG GGGCCCGTTT 


300 


TAGCTGCACT 


CACCTCTCTC 


GGTGGCTCCT 


CTGGTTTCCT TCTCCAGCAG CTCCCCCATC 


360 


TCAGCGGGGG 


CCATCCCCCT 


GGGAGGGGAG 


ACAAGGGACA GGAGGGCTGG TCAGCCCAGT 


420 


AGAGAGTTGG 


GGGGTCCAGG 


ATGCCTGGGC 


CCTGGGAAGA GAGAGTAGGC TCCGGGGCCT 


480 


ACCTCTTCCT 


CTGGCCCTTC 


CGCGGCCTCG 


GCTGCCCGGA GCCGCACAAC CCTCCCCGGG 


540 


CCGCATAATC 


CCTCCTTGAT 


GACCCTCCCT 


CTCGGTAGTA CCCGCACTCT GGGGCCGAGA 


600 


GAAGAGGAGG 


GGGCACGGAC 


TCTTGGGGGG 


GGCCTCCGAG CCCGGCCCCG CCCCTCTCCC 


660 


CGGCTGCACG 


CGCCGATACC 


CTTTGTACCC 


AGGCGCGGGA CCCGGACAAT CCTCAGATCC 


720 
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TCCAGCACCC GCTGCCCCCC AGCCCGGTGG ACGGCCCCTC GTGCCCCTCA CGCGTGCTCC 780 

TGGGGCCCCG GCGCCCGTCG CCCAGTGCGG GCAGGCCGGC GGCTGCACGC GCGCCTCCGT 840 

GCCCACTCCC CCCACCTCCC ACACCCTGGT CCCCTCATCC GCCCCCGGTG CTGGCCCCCT 900 

GGATTGCTGC AAGTCCCGCC CGGCCCCCGG CCCCGTTGCA CCCCCGGAGC ATTGCACGGC 960 

GCTTCCCCCG GGGGCGCGCG CGGGCATGCA CCCGCCTCTC CCCCTCCCTT CCGCACCTCG 1020 

GCGGCCGCCG CCGCTGCAGC TCCCGCCGCC GCCGCCATCG CGCTTGCGCT GGGGGCCGAG 1080 

CCGGCGCGCG GCCGCCCCGG GTCACGTGGG CGAGGGAGGG AGGGCGAGGA GGAGCCTTAA 1140 

AGGAGCCGCT ACATGCTTTT TGGCCATTTT CCCCTGAGAG CGGCCTCGGA GATGGCTGTG 1200 

ACTGTCCTAA GCTGGGAGCT GCAAGGGAGA ATTCCTGTCA TTCCTGGCCT CAGTTCTGCA 1260 

GGGACCGAGG GCGAGACACG CCTGGGCCCA GGTGTGGCGT CTCTGTCCCC ATCTGGTTTT 1320 

AGGTAACAAG CGGACGTTCT GAACTTCTCG GCTCTCGGCA GCGGCTGTAT TTCCTCTGGC 1380 

CTGGTTGGGC TTTTCCCGCC TCTGGTTGCT TTTCTGCCTT TCTAGTTTTT GGGTTACCAG 1440 

ATAGAAGGCT TGGCCTCAGT TTTGGCCTCG CCTTTTTGCT CTTTCTAACG AGCACGAAGG 1500 

GGCGATAGGG ACGCGGAGGA CACCTTTATT CTTGGCTGGT TCTAGCATGC TGCTTCATGT 1560 

CCCCTGGAGC AGCGTGCCCT TCTGAAAACC TGTGGCTAAA TGTCTCTTCT GTTTATATCT 1620 

GGCGTGTTAC ACCTTCACAC GCACTAGGAT CC 1652 
(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 530 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

CGAGGCCATC CGGCGGGCCT GCCTGCCCAC GCCGCCGCTG CAGAGCAACC TCTTCGCCAG 60 

CCTGGACGAG ACGCTGCTGG CGCGGGCCGA GGCGCTGGCG GCCGTGGACA TCGCCGTGTC 120 

CCAGGGCAAG AGCCATCCTT TCAAGCCGGA CGCCACGTAC CACACGATGA ACAGCGTGCC 180 

GTGCACGTCC ACTTCCACGG TGCCTCTGGC GCACCACCAC CACCACCACC ACCACCACCA 240 
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GGCGCTCGAA CCCGGCGATC TGCTGGACCA CATCTCCTCG CCGTCGCTCG CGCTCATGGC 300 
CGGCGCGGGC GGCGCGGGCG CGGCGGCCGG CGGCGGCGGC GCCCACGACG GCCCGGGGGG 360 
CGGTGGCGGC CCGGGCGGCG GCGGCGGCCC GGGCGGCGGC GGCCCCGGGG GAGGCGGCGG 420 
TGGCGGCCCG GGGGGCGGCG GCGGCGGCCC GGGCGGCGGG CTCCTGGGCG GCTCCGCGCA 480 
CCCTCACCCG CATATGCACA GCCTGGGCCA CCTGTCGCAC CCCGCGGCGG 530 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 515 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 



ATGGCGATAA 


CAGTCATGTG 


GAGATGAAAC 


TTGCTGTAGA TGAAGAAGAA AATGCTGACA 


60 


ATAACACAAA 


GGCCAATGTC 


ACAAAACCAA 


AAAGGTGTAG TGGAAGTATC TGCTATGGGA 


120 


CTATTGCTGT 


GATCGTCTTT 


TTCTTGATTG 


GATTTATGAT TGGCTACTTG GGCTATTGTA 


180 


AAGGGGTAGA 


ACCAAAAACT 


GAGTGTGAGA 


GACTGGCAGG AACCGAGTCT CCAGTGAGGG 


240 


AGGAGCCAGG 


AGAGGACTTC 


CCTGCAGCAC 


GTCGCTTATA TTGGGATGAC CTGAAGAGAA 


300 


AGTTGTCGGA 


GAAACTGGAC 


AGCACAGACT 


TCACCAGCAC CATCAAGCTG CTGAATGAAA 


360 


ATTCATATGT 


CCCTCGTGAG 


GCTGGATCTC 


AAAAAGATGA AAATCTTGCG TTGTATGTTG 


420 


AAAATCAATT 


TCGTGAATTT 


AAACTCAGCA 


AAGTCTGGCG TGATCAACAT TTTGTTAAGA 


480 


TTCAGGTCAA 


AGACAGCGCT 


CAAAACTCGG 


TGATC 


515 



(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 
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GCTCAGCTCC GTTTCGGTTT CACTTCCGGT 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
AGCCCCGCAC TTCCACCACC AGCTCCTCCA 
(2) INFORMATION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2584 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 



GAGGAGCAGC 


GAGTCAAGAT 


GAGAGTTCAG 


CCGCGGCGGC AGCAGCAGCA GACTCAAGAA 


60 


TGAACAATCC 


GTCAGAAACC 


AGTAAACCAT 


CTATGGAGAG TGGAGATGGC AACACAGGCA 


120 


CACAAACCAA 


TGGTCTGGAC 


TTTCAGAAGC 


AGCCTGTGCC TGTAGGAGGA GCAATCTCAA 


180 


CAGCCCAGGC 


GCAGGCTTTC 


CTTGGACATC 


TCCATCAGGT CCAACTCGCT GGAACAAGTT 


240 


TACAGGCTGC 


TGCTCAGTCT 


TTAAATGTAC 


AGTCTAAATC TAATGAAGAA TCGGGGGATT 


300 


CGCAGCAGCC 


AAGCCAGCCT 


TCCCAGCAGC 


CTTCAGTGCA GGCAGCCATT CCCCAGACCC 


360 


AGCTTATGCT 


AGCTGGAGGA 


CAGATAACTG 


GGCTTACTTT GACGCCTGCC CAGCAACAGT 


420 


TACTACTCCA 


GCAGGCACAG 


GCACAGGCAC 


AGCTGCTGGC TGCTGCAGTG CAGCAGCACT 


480 


CCGCCAGCCA 


GCAGCACAGT 


GCTGCTGGAG 


CCACCATCTC CGCCTCTGCT GCCACGCCCA 


540 


TGACGCAGAT 


CCCCCTGTCT 


CAGCCCATAC 


AGATCGCACA GGATCTTCAA CAACTGCAAC 


600 


AGCTTCAACA 


GCAGAATCTC 


AACCTGCAAC 


AGTTTGTGTT GGTGCATCCA ACCACCAATT 


660 
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TGCAGCPAGP 


GPAGTTTATP 


ATPTPAPAPA 
A \ L 1 LALAbA 


LbLLLLAbbb CCAGCAGGGT CTCCTGCAAG 


720 


CGCAAAATPT 


TPAAAPP.PAA 


PTAPPTPAPP 
L I ALL 1 LAbL 


AAAbLLAAbC CAACCTCCTA CAGTCGCAGC 


780 


CAAGPATPAP 


PPTPAPPTPP 


PAPPPAPPAA 
LAbLLAbLAA 


LLLLAALACb CACAATAGCA GCAACCCCAA 


840 


TTPAGAPAPT 


TPPAPAPAPP 


PAPTPAAPAP 
LAb 1 LAALAL 


PAAATrPA AT T^ATArTrpr a r~ t—t r> a 

LAAAbLbAAl IbATACTCCC AGCTTGGAGG 


900 


AGCCCAGTGA 


CPTTGAGGAG 


PTTPiAPPAPT 


TTPPPAAPAP rTTrAAAPA A APATPA ATPA 

I I bLLAAbAL L 1 1 LAAALAA AbALbAATCA 


960 


AACTTGGATT 


CACTCAGGGT 


GATGTTGGGP 

V-m lull vjVjvjV* 


TPftPTATftfin nAAAPTATAT PPAAATPATT 
i lul 1 A 1 bbb bAAAL 1 A 1 A 1 bbAAA 1 uAL 1 


1020 


TCAGCCAAAC 


TACCATCTCT 


CGATTTGAAG 


PPTTGAAPPT PAHPTTTAAft AAPATHTPPA 
i « uaall. 1 L-AbL 1 1 1 AAb AALA 1 b 1 bLA 


i fion 
lUoO 


AGTTGAAGCC 


AC 1 1 1 IAGAG 


AAGTGGPTAA 


ATGATPiPAHA HAAPPTPTPA TPTPATTPPT 
A I bA I ULAbA bAALL 1 L 1 LA 1 L i bA 1 I Lb 1 


1 1 Af\ 

1140 


CCCTCTCCAG 


CCCAAGTGPP 


PTGAATTPTP 


PAHftAATTfiA cn.ciCTTrts.rr rrT/\rr acta 
LAbbAA 1 i bA bbbL I 1 bAbL Lb 1 AbbAbbA 


1200 


AGAAACGCAf 


PAGPATAGAG 


APPAAPATPP 


CTcn'mrrTT apapaapapt ttpttpp a 
b 1 b 1 bbLL 1 1 AbAbAAbAb i 1 1 L II ubAbA 


1260 


ATCAAAAGCC 


TAPPTPGGAA 


GAGATPAPTA 


TPATTPPTPA TPAPPTPAAT ATPPAAaaap 
I bA 1 1 bL 1 bA 1 LAbL 1 LAA 1 A 1 bbAAAAAb 


1320 


AGGTGATTfG 


TGTTTGGTTP 

1 u t 1 1 OVJ 1 1 ^ 


TPTAAPPPPP 
1 b 1 AALLbLL 


PPPAPAAAPA aaaaapaatp AAprrArrA a 
bLLAbAAAbA AAAAAbAA 1 L AALLLAlLAA 


1380 


GCAGTGGTGG 


GAPPAftPAftP 


TPAPPTATTA 
1 LALL 1 A II A 


A APP A AT f T T PPfPAPPPPA ArTTPArTrr 
AAbLAAl 1 1 1 LLLLAbLLLA ALTTLACTbG 


1440 


TGGrGAffAr 


APPAAGPPTT 


ptpaptappa 

b 1 bAL 1 AULA 


PTPPAPPAAP TAPPPTPapa PTrArrrrTr 
b 1 bLAbLAAL 1 ALLL 1 LALA b \ LAbLLL 1 o 


i c r\ r\ 

1500 


TppTpppTpT 

1 1 LLL 1 L 1 


ftAPPAPTPPT 
bALLAb 1 bL 1 


PPTPTPAPPA 
bL 1 b 1 bALbA 


ATP'T T TP APT TATAPrrArT TPAPAPArrA 

A 1 L 1 1 1 LAb 1 1 ALAbbLALT TlAGALACCA 


1560 


PPTPPAAPAA 


PAPAPPAAPP 
L AL AbL AAL l 


rTr ATTTPPA 
b 1 bA 1 1 1 LLA 


c APcrrcTrr appttp^tpa ppaptp appt 
LAbLbLLlLL AbLnLLlLA bLAbTCACbT 


1620 


PPPPPTprpT 

V_*UL^U J.L 1 L 1 


ftAPTPPPTPP 
OnU I LLL 1 LL 


PPTTPTPPPT 
LL 1 1 L 1 LLL 1 


PAPPPTPPAP PTPPPAPPPA TPPAPTPPPA 

LAbLL 1 LLAL L 1 LLbAbbLA 1 LLAb 1 uLLA 


i con 

1680 


PiTHAftAPPAP 


PAPAAPAPAP 
L AL AAL AL Ab 


APPAPPTPPA 
ALLALL i LLA 


PTPPTTTPTP PTPPPPTPTT Cm ACT ACCC 
L 1 LL II 1 b 1 L LI LLLL 1 L 1 1 bbbALLAbLL 


1/40 


Abb 1 un 1 bb 1 


PAPAPPATPA 
bALAbLA 1 LA 


rp i i i pr AAA 
bb 1 l 1 bLAAA 


PAPPAPPAPP TPPTPPPPTT PAAPPAPPTP 
LAbLAbLAbL 1 bL 1 bLLL 1 1 LAAbbAbL 1 b 


i onn 
loOO 


PAPAPnTPPP 


APPAAATPPP 
AbL AAA 1 bLL 


APTPTTPPTP 
Ab 1 L 1 1 bL 1 b 


PPATPPPAPP TPPTPPAPPA PTAAAPTPAA 
LLA 1 bbLAbL 1 bL 1 bLAbbA L 1 AAALLLAA 




gpptgatggp 

uLL 1 Un 1 uuU 


APPPTPAPAP 
ALLL I LALAb 


1 t 1 bLbbL 1 b 


PAPPTPPPTT APTPAPTPTP AATPPAPPPA 
bAbb t bLL 1 1 AL 1 LAb 1 L 1 b AA 1 LLAbbbA 


iy^u 


ppptpapppp 

LLL 1 UMuLuU 


TPPTPTPAPP 
1 uU 1 L 1 LAbL 


PPAPPTPTAA 
LLAbL 1 L 1 AA 


TPAPPAAPAP TAPAPTPPPA APTATTr A AP 

1 bAbLAALAb 1 ALAL 1 bbLA AL 1 A 1 1 LAAb 


1980 


PTPTTPPTTP 


tpptppptpt 


PTTPPAATAA 
L 1 1 LLAA 1 AA 


PATPAPTTPA TPPAAPTPPP A APPTPPTAT 
LA 1 LAL 1 1 bA 1 bLAAL 1 bbb AALL 1 bb 1 A 1 


on/in 


TTGCCAATGC 


GGGAGGAGCC 


CCCAACATCG 


TGACTGCCCC TCTGTTCCTG AACCCTCAGA 


2100 


ACCTCTCTCT 


GCTCACCAGC 


AACCCTGTTA 


GCTTGGTCTC TGCCGCCGCA GCATCTGCAG 


2160 


GGAACTCTGC 


ACCTGTAGCC 


AGCCTTCACG 


CCACCTCCAC CTCTGCTGAG TCCATCCAGA 


2220 


ACTCTCTCTT 


CACAGTGGCC 


TCTGCCAGCG 


GGGCTGCGTC CACCACCACC ACCGCCTCCA 


2280 


AGGCACAGTG 


AGCTGGGCAG 


AGCTGGGCTG 


CCAGAAGCCT TTTTCACTCT GCAGTGTGAT 


2340 
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TGGACTGCCA GCCAGGTTAA TAAACTGAAA AATGTGATTG GCTTCCTCTC GCCGTGTTGT 2400 

GAGGGCAAAG GAGAGAAGGG AGAAAAAAAA AAAAAAAACC ACACACACCC ATACACAATA 2460 

TACCAGAAAA GGAAGGAAGG ATGGAGACGG AACATTTGCC TAATTTGTAA TAAAACACTG 2520 

TCTTTTCAGG GTTGCTTCAT GGGTTGGAGG ACTTTCTAAC CAAAAATTAA AAAAAAAAAA 2580 

AAAA 2584 



